Using Information Extraction to Build a Directory of Conference Announcements

نویسنده

  • Karl-Michael Schneider
چکیده

We describe an application of information extraction for building a directory of announcements of scientific conferences. We employ a cascaded finite-state transducer to identify possible conference names, titles, dates, locations and URLs in a conference announcement. In order to cope with agrammatical text that is typical for conference announcements, our system uses orthographic features of the text and a domain-specific tag set, rather than general purpose part-of-speech tags. Extraction accuracy is improved by recognizing other entities in the text that are not extracted but could be confused with slot values. A scoring scheme based on some simple heuristics is used to select among multiple extraction candidates. We also present an evaluation of our system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extracting Information from Conference Announcements: High Recall, High Precision

Recall, High Precision Kevin Cheong Language Technology Group, Microsoft Research Institute School of MPCE, Macquarie University Sydney NSW 2109, Australia [email protected] Abstract Conference announcements are distributed widely each day via electronic mail to the research and industrial community. These conferences inform researchers, academics and the industry about the research and dev...

متن کامل

Towards Multicast Session Directory Services

The current increase in interest in Internet Multicast and Multimedia Applications is self-evident and has profited from such infrastructures as the Multicast Backbone (Mbone) and its applications and tools which are several years old now. However, an ad-hoc usage of some of this technology’s features, namely, session announcements and multicast addresses, do not scale easily. The Session Descr...

متن کامل

AIMS Embedded Systems Programming

Open the file Makefile. This file contains all information necessary to build the project. Makefiles define build goals, which can be run using the command make. The default build goal is called all. make supports many helpful macros which simplify the build definition. $(wildcard *.c) defines a list of all .c files in the current directory. $(addprefix build/,$(SRC:.c=.o)) adds a prefix to eac...

متن کامل

.The effect of information resources on the selection of strategies for adaptation to climate change by farmers (Case study: Golestan Province)

Background and Aim: The use of information resources is one of the important strategies in the selection of adaptation strategies to climate change by farmers. The aim of this study was to determine the effect of information resources on the selection of adaptation strategies to climate change by farmers in Golestan province. Method: The research was descriptive and survey. The statistical popu...

متن کامل

IJCAI - 97 Wrapper Induction for Information Extraction

Many Internet information resources present relational data|telephone directories, product catalogs, etc. Because these sites are formatted for people, mechanically extracting their content is di cult. Systems using such resources typically use hand-coded wrappers, procedures to extract data from information resources. We introduce wrapper induction, a method for automatically constructing wrap...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004